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Minimum mean square error (MMSE) signal detection algorithm is near- 
optimal for uplink multi-user large-scale multiple input multiple output 
(MIMO) systems, but involves matrix inversion with high complexity. 
In this letter, we firstly prove that the MMSE filtering matrix for large- 
scale MIMO is symmetric positive definite, based on which we propose 
a low-complexity near-optimal signal detection algorithm by exploiting 
the Richardson method to avoid the matrix inversion. The complexity 
can be reduced from 0(K 3 ) to 0(K 2 ). where K is the number of 
users. We also provide the convergence proof of the proposed algorithm. 
Simulation results show that the proposed signal detection algorithm 
converges fast, and achieves the near-optimal performance of the classical 
MMSE algorithm. 


Introduction: Large-scale multiple input multiple output (MIMO) 
employing hundreds of antennas at the base station (BS) to simultaneously 
serve multiple users is a promising key technology for 5G wireless 
ly-yommunications (TJ. It can achieve orders of magnitude increase in 
.spectrum and energy efficiency, and one challenging issue to realize 
(^~$ uch goal is the low-complexity signal detection algorithm in the uplink, 
^\Jlue to the increased dimension of large-scale MIMO systems (5|. The 

_pptimal signal detection algorithm is the maximum likelihood (ML) 

i—algorithm, but its complexity increases exponentially with the number of 
|—transmit antennas, making it impossible for large-scale MIMO. The fixed- 
, -.complexity sphere decoding (FSD) 0 and tabu search (TS) 0 algorithms 
N ~4 iave been proposed with reduced complexity, but their complexity is 
uafordable when the dimension of the large-scale MIMO system is large 
the modulation order is high 0. Low-complexity linear detection 


^ p lporithms such as zero-forcing (ZF) and minimum mean square error 
^^MMSE) with near-optimal performance have been investigated 0, but 
^hese algorithms have to use unfavorable matrix inversion, whose high 
^mmplexity is still not acceptable for large-scale MIMO systems. Very 

i_recently, the Neumann series approximation algorithm has been proposed 

to approximate the matrix inversion |6j, which converts the matrix 
’“Inversion into a series of matrix-vector multiplications. However, only 
^marginal complexity reduction can be achieved, 
m In this letter, we propose a low-complexity near-optimal signal 
■^"detection algorithm by exploiting the Richardson method 0 to avoid 
'xfthe complicated matrix inversion. We firstly prove a special property of 
■^Jlarge-scale MIMO systems that the MMSE filtering matrix is symmetric 
C~ positive definite, based on which we propose to exploit the Richardson 
'method to avoid the complicated matrix inversion. Then we prove the 
^-^convergence of the proposed algorithm for any initial solution when the 
J^j'elaxation parameter is appropriate. Finally, we verify through simulations 
t hat the proposed signal detection algorithm can efficiently solve the matrix 
. jnversion problem in an iterative way until the desired accuracy is attained, 
^and achieve the near-optimal performance of the MMSE algorithm with 
fexact matrix inversion. 
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rge-Scale MIMO System Model: We consider a uplink multi-user 
arge-scale MIMO system which employs N antennas at the BS to 
simultaneously serve K single-antenna users. Usually we have N > K, 
e.g., N = 128 and K = 16 have been considered 0 0 - For signal 
detection, the complex-valued system model can be directly converted 
to a corresponding real-valued system model, then the estimate of the 
2 K x 1 transmitted signal vector s coming from K difference users can 
be achieved by the classical MMSE algorithm as 0 


s= (H H H + a 2 I 2K ) 1 H»y = W- 


y. 


0 ) 


where H is the 2N x 2 K MIMO channel matrix, which can be obtained 
through frequency-domain and/or time-domain training pilots 0, o 2 is 
the additive white Gaussian noise (AWGN) power, 1 2 x is an identity 
matrix of size 2K x 2K, y is the 2 N x 1 received signal vector at the BS, 
y = H H y can be interpreted as the matched-filter output of y, and finally 
W = H H H + tr 2 I 2 x denotes the MMSE filtering matrix. Note that the 
direct computation of the matrix inversion W -1 requires relatively high 
complexity of 0(K 3 ). 


Proposed Signal Detection Based on Richardson Method: Unlike the 
conventional (small-scale) MIMO systems with small number of antennas, 
large-scale MIMO systems have a special property that the MMSE filtering 
matrix W determined by the MIMO channel matrix H is symmetric 
positive definite, which can be proved as below. 

Lemma 1. For signal detection of large-scale MIMO systems, the MMSE 
filtering matrix W is symmetric positive definite. 

Proof: The column vectors of the channel matrix H in large-scale 
MIMO systems are asymptotically orthogonal (i.e., rank (H) = 2 K) 0 
Then we have the equation Hr = 0 when and only when r is a 2 1\ x 1 
zero vector. Thus, for an arbitrary non-zero 2 K x 1 vector r, we have 

(Hr) ff Hr = r ff (H H H)r>0, (2) 

which indicates that the Gram matrix G =H ff H is positive definite. In 
addition, as we have G H = (H ti H) h = G, so G is also symmetric. Thus, 
the Gram matrix G is symmetric positive definite. Finally, as the noise 
power <t 2 is positive, we can conclude that the MMSE filtering matrix 
W = H W H + o 2 I 2 k is symmetric positive definite matrix, too. □ 

The special property that the MMSE filtering matrix W in large-scale 
MIMO systems is symmetric positive definite inspires us to exploit the 
Richardson method [7j to efficiently solve (1) in an inversion-less way. 
The Richardson method is used to solve the ^-dimension linear equation 
Ax = b, where A is the N x N symmetric positive definite matrix, x is 
the N x 1 solution vector, and b is the N x 1 measurement vector. The 
Richardson iteration can be described as 

x( i+1 ) = x(’)+i»(b-Ax (i >), i = 0,1,2, (3) 

where the superscript i denotes the number of iterations, and w represents 
the relaxation parameter. Since W in (1) is also symmetric positive definite 
as proved above, we can exploit the Richardson method to estimate the 
transmitted signal vector s without matrix inversion as below: 

s (»+i) _ S W w ^y _ , i = 0,1, 2, • • • (4) 

where the initial solution s(°) can be usually set as a 2 K x 1 zero vector 
without loss of generality due to no priori information of the final solution 
is available [7], Such initial solution will not affect the convergence of 
the Richardson method, since the symmetric positive definite matrix W 
guarantees the convergence of the Richardson method for any initial 
solution as we will prove in the following Lemma 2. Consequently, the 
final accuracy will also not be affected by the initial solution if the 
number of iterations i is large (e.g., i = 5), as will be verified later in 
the simulation results. Since the relaxation parameter w in (4) plays an 
important role in convergence, next we will prove that the convergence 
of the Richardson method for any initial solution when the relaxation 
parameter is appropriately selected. 

Lemma 2 .For the N-dimension linear equation Ax = b. the necessary 
and sufficient conditions for convergence of the Richardson method is that 
the relaxation parameter w satisfies 0 < w < 2/Ai, where Ai is the largest 
eigenvalue of symmetric positive definite matrix A. 

Proof: We define D = Ijv — w A and c = uib, where D is the iteration 
matrix. Then the Richardson iteration (3) can be rewritten as 

x (i+1) =Dx {i) +c, i = 0,1,2, -■ ■ (5) 

We call the iteration procedure is convergent if lim sM = g and 

%—>-00 

s = Bs + c for any initial solution . 

The spectral radius of iteration matrix D &R NxN is the non¬ 
negative number p(D) = max |/r„(D)|, where /r„(D) denotes the nth 

l<n<N 

eigenvalue of D, and the necessary and sufficient conditions for the 
convergence of (5) is that the spectral radius should satisfy [7, Theorem 
7.2.2] 

p(D) = max \p, n (D) | < 1. (6) 

1 <n<N 

Without loss of generality, we use Ai > A 2 > • ■ ■ > Ajv > 0 to denote the 
N eigenvalues of symmetric positive definite matrix A, where Ai is the 
largest one. Because of D = I — wA, we have /r„(D) = 1 — w\ n , where 
A n is the nth eigenvalue of A. which can be substituted into (6), and then 
we have 0 < w < 2/Ai. □ 

Computational Complexity: The complexity in terms of required number 
of multiplications is analyzed for comparison. It can be found from (4) that 
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Table 1 : Computational Complexity 



Conventional Neumann 
series approximation (6) 

Proposed algorithm based 
on Richardson method 

i = 2 

12A' 2 - 4 K 

8 K 2 + 4 K 

i = 3 

8 K 3 + 4 K 2 - 2K 

12 K 2 + 6 K 

2 = 4 

16 K 3 - 4K 2 

16K 2 + 8 K 

4 = 5 

24K 3 - 12K 2 + 2K 

20K 2 + 10 K 


the 4th iteration of the proposed signal detection algorithm involves one 
multiplication of a 2K x 2K matrix W with a 2 K x 1 vector sW, and one 
multiplication of a constant relaxation parameter w with a 2K x 1 vector 
y — Wsf‘), thus the required number of multiplications is 4 K + 2K for 
each iteration. 

Table U compares the complexity of the conventional Neumann 
series approximation algorithm (6) and the proposed algorithm based on 
Richardson method. It is well known that the complexity of the classical 
MMSE algorithm is 0(K 3 ), and Table Q] shows that the conventional 
Neumann series approximation algorithm can reduce the complexity from 
0(K 3 ) to 0(K 2 ) when the number of iterations is 4 = 2. However, the 
complexity is 0(I\ 3 ) when 4 > 3. Since usually large value of 4 is required 
to ensure the final approximation performance (e.g., 4 = 5 as will be 
verified later by simulation results), the overall complexity is still 0(K 3 ), 
which indicates that only marginal complexity reduction can be achieved. 
On the contrary, the complexity of the proposed algorithm is reduced from 
0(K 3 ) to 0(K 2 ) for any arbitrary number of iterations. 

Simulation Results: The simulation results of the bit error rate (BER) 
perfonnance against the signal-to-noise ratio (SNR) are provided to 
compare the proposed signal detection algorithm with the recently 
proposed Neumann series approximation algorithm (6), whereby the SNR 
is defined at the receiver. The BER performance of the classical MMSE 
algorithm with complicated but exact matrix inversion is also included 
as the benchmark for comparison. We consider an N x K = 128 x 16 
large-scale MIMO system employing the modulation scheme of 64 QAM, 
and the rate-1/2 convolutional code with [133 0 171 0 ] polynomial together 
with a random interleaver. We adopt flat Rayleigh fading channel. At the 
receiver, the log-likelihood ratios (LLRs) are extracted from the detected 
signal for soft-input Viterbi decoding (9). Through intensive simulations, 
we find out that when N and K are fixed, the largest eigenvalue of the 
MMSE filtering matrix W is around a certain value, and accordingly the 
relaxation parameter is set as w = 0.00645 to guarantee the convergence. 
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achieved by the proposed signal detection algorithm. More importantly, 
when the number of iterations is moderately large (e.g., 4 = 5 in Fig. 1), the 
proposed algorithm without the complicated matrix inversion can achieve 
the near-optimal BER performance of the MMSE algorithm with exact 
matrix inversion. 

Conclusions: By fully exploiting the special property that the MMSE 
filtering matrix in large-scale MIMO systems is symmetric positive 
definite, we propose a low-complexity near-optimal signal detection 
algorithm based on the Richardson method to avoid the complicated 
matrix inversion, which can reduce the complexity from 0(K 3 ) to 
0(K 2 ). We also prove the convergence of the proposed algorithm for any 
initial solution when the relaxation parameter is appropriate. Simulation 
results verify that the proposed algorithm outperforms the conventional 
method, and achieves the near-optimal performance of the classical MMSE 
algorithm. 
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Fig. 1 BER performance comparison in an N X K = 128 X 16 large-scale 
MIMO system. 


Fig. 1 shows the BER performance comparison results, where i denotes 
the number of iterations. It is clear that the BER performance of both 
algorithms improves with the number of iterations, but the proposed 
algorithm outperforms the conventional one when the same number of 
iterations is used, which indicates that a faster convergence rate can be 
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